Attentive Feature Augmentation for Long-Tailed Visual Recognition

نویسندگان

چکیده

Deep neural networks have achieved great success on many visual recognition tasks. However, training data with a long-tailed distribution dramatically degenerates the performance of models. In order to relieve this imbalance problem, an effective Long-Tailed Visual Recognition (LTVR) framework is proposed based learned balance and robust features under circumstances. framework, plug-and-play Attentive Feature Augmentation (AFA) module designed mine class-related variation-related original samples via novel hierarchical channel attention mechanism. Then, those are aggregated synthesize fake cope dataset. Moreover, Lay-Back Learning Schedule (LBLS) developed ensure good initialization feature embedding. Extensive experiments conducted two-stage method verify effectiveness both learning classifier rebalancing in image task. Experimental results show that, when trained imbalanced datasets, achieves superior over state-of-the-art methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attentive Visual Recognition for Scene Exploration

Vision is an active process where behaviorally important information is selectively gathered. We present a model of scene exploration in which the high-resolution fovea is deployed to interesting regions by selective attention processes. The system switches between exploratory and recognition modes such that in exploration mode, regions of interest are investigated and in recognition mode, a gi...

متن کامل

Attentive Tracking Disrupts Feature Binding in Visual Working Memory.

One of the most influential theories in visual cognition proposes that attention is necessary to bind different visual features into coherent object percepts (Treisman & Gelade, 1980). While considerable evidence supports a role for attention in perceptual feature binding, whether attention plays a similar function in visual working memory (VWM) remains controversial. To test the attentional re...

متن کامل

Learning Convolutional Feature Hierarchies for Visual Recognition

We propose an unsupervised method for learning multi-stage hierarchies of sparse convolutional features. While sparse coding has become an increasingly popular method for learning visual features, it is most often trained at the patch level. Applying the resulting filters convolutionally results in highly redundant codes because overlapping patches are encoded in isolation. By training convolut...

متن کامل

Pre-attentive visual selection

In this note, we consider the critical function of the selection from an entire visual scene of visual locations or objects for detailed or attentive processing. Such selection of some places at the expense of others is necessary because attention has only a meagre capacity (estimated at just 40 bits/second; Sziklai 1956). The same bottleneck implies that the selection process itself cannot gen...

متن کامل

10 Attentive Visual

A panel of computer vision researchers recently convened and produced a document concerning the relatively new eld of \Active Vision" 36]. Although there is no precise deenition of what active vision is, most researchers would agree that active vision is concerned with controlling camera parameters, such as position, focal length, aperture width, and so on, in ways that serve to make vision pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2022

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2022.3161427